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The crystal structure of recombinant wild-type green fluorescent protein {GFP) has been solved to a 
resolution of 1,6 A by multlwavelength anomalous dispersion phasing methods. The protein is In the 
shape of a cylinder, comprising 11 strands of 8-sheet with an n-helix inside and short hellcat segments 
on the ends of the cylinder. This mottf, with fl-stntcture on the outside and a-helix on the Inside, repre- 
sents a new protein fold, which we have named the 8-can. Two protomers pack closely together to form 
a dimer In the crystal. The fluorophores are protected inside the cylinders, and their structures are con- 
sistent with the formation of aromatic systems made up of Tyr** with reduction of Its Ca-CB bond cou- 
pled with oycllzatlon of the neighboring glycine and serine residues. The environment Inside the cylinder 
explains the effects of many existing mutants of QFP and suggests speclflo side chains that could be 
modified to change the spectral properties of QFP. Furthermore, the Identification of the dimer contacts 
may allow mutagenic control of the state of assembly of the protein. 
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Green fluorescent protein (GPP) is 8 spontaneously fluorescent 
protein isolated from coelenterates, such as the Pacific jellyfish, 
Aequorea victoria*. Its role is to transduce, by energy transfer, the 
blue chemtluminescence of another protein, aequorin, into green 
fluorescent light 1 . The molecular cloning of GFP cDNA' and the 
demonstration by Chalfie and colleagues* that GFP can be expressed 
as a functional transgene have opened exciting new avenues of 
investigation in cell, developmental, and molecular biology. 
Fluorescent GFP has been expressed in bacteria 4 , yeast 1 , slime 
mold', plants'- 1 , DrosophM, zebrafish", and in mammalian cells 1 " 1 . 
GFP can function as a protein tag, because It tolerates N-termtnal 
and C-terminal fusion to a broad variety of proteins many of which 
have been shown to retain native function**"-". When expressed in 
mammalian cells fluorescence from wild-type GFP is typically dis- 
tributed throughout the cytoplasm and nucleus, but excluded from 
the nucleolus and vesicular organelles' 1 . However, highly specific 
intracellular localization including the nucleus, mitochondria", 
secretory pathway", plasma membrane", and cytoskeleton' can be 
achieved via fusions both to whole proteins and Individual target- 
ing sequences. The enormous flexibility as a noninvasive marker in 
living cells allows for numerous other applications such as a cell 
lineage tracer, reporter of gene expression, and as a potential mea- 
sure of protein-protein interactions". 

GFP is composed of 238 amino acids. Its wild-type absorbance/ 
excitation peak is at 395 nm with a minor peak at 475 ran with 
extinction coefficients of roughly 30,000 and 7,000/M/cm, respec- 
tively". The emission peak is at 508 nm. Interestingly, excitation at 
395 nm leads to decrease over time of the 395 nm excitation peak 
and a, reciprocal increase in the 475 run excitation band". This pre- 
sumed photolsomerization effect is especially evident with irradia- 
tion of GFP by ultraviolet (UV) light. Analysis of a hexapeptide 
derived by proteolysis of purified GPP led to the prediction that 
the fluorophore originates from an internal Ser-Tyr-Giy sequence, 
which is post-translationally modified to a 4-(p-hydroxybenzyli- 
dene)-imidazolidin-5-one structure*'. Studies of recombinant GFP 



expression in Escherichia colt led to a proposed sequential mecha- 
nism initiated by a rapid cyclization between Ser 65 and Gly 67 to 
form an imtdazolin-5-one intermediate followed by a much slower 
(hours) rate-limiting oxygenation of the Tyr 66 side chain by O, 
(ref. 21). Combinatorial mutagenesis suggests that the Gly 67 is 
required for formation of the fluorophore". While no known 
cofactors or enzymatic components are required for this appar- 
ently autocatalytic process, It is rather thermosensitive with the 
yield of fluorescently active to total GFP protein decreasing at 
temperatures greater than 30°C". However, once produced, GFP is 
quite thermostable, 

Physical and chemical studies of purified GFP have identified 
several Important characteristics. It is very resistant to denatura- 
tion requiring treatment with 6 M guanidine hydrochloride at 
90*C or pH of <4.0 or >12.0. Partial to near total renaturation 
occurs within minutes following reversal of denaturing conditions 
by dialysis or neutralization". Circular dichroisrn predicts signifi- 
cant amounts of p-sheet structure that is subsequently lost on 
denaturatlon". Over a nondenaturing range of pH, increasing pH 
leads to a reduction in fluorescence by 395 nm excitation and an 
increased sensitivity to 475 nm excitation". Reduction of purified 
GFP by sodium dithionite results in a rapid loss of fluorescence 
that slowly recovers in the presence of room air. While insensitive 
to sulfhydryl reagents such as 2-mercaptoethanol, fluorescence is 
irreversibly eliminated by treatment with the sulfhydral reagent 
dithiobisnltrobenzoic add*. 

The availability of S, coli clones expressing GFP has led to exten- 
sive mutational analysis of GFP function, Truncation of more than 
seven amino adds from the C-terminus, or more than the N-terml- 
nal Met, led to total loss of fluorescence". All nonfiuorescent 
mutants also failed to exhibit absorption spectra characteristic of the 
intact fluorophore, implying a possible defect in post-translational 
processing. Screens of random and directed point mutations for 
changes in fluorescent behavior have uncovered a number of infor- 
mative amino add substitutions. Mutation of Tyr 66 in the fluo- 
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rophore to His results in a shift of the excitation maximum to the 
UV (383 nm) with emission now in the blue at 448 nm 
(ref. 21). A Tyr 66-Trp mutant is blue-shifted, albeit to a lesser 
degree. Both changes are associated with a severe weakening of 
fluorescence intensity compared with wild-type GFP. Mutation of 
Ser 65 to Thr, Ala, Cys, or Leu causes a loss of the 395-nm excitation 
peak with a major increase in blue excitation"*. When combined 
with Ser 65 mutants, mutations at other sites near the fluorophore 
such as Val 68-Leu and Ser 72-Ala can further enhance the intensity 
of green fluorescence produced by excitation at 488 nmfrefs. 22 and 
29). However, amino add substitutions significantly outside this 
region also affect the protein's spectral character. For example, Ser 
202-Phe and Thr 203 -He both cause the loss of excitation in the 475 
nm region with preservation of 395 nm excitation" lie 167-Thr 
results in a reversed ratio of 395 to 475 nm sensitivity", while Glu 
222-Gry is associated with the elimination of only the 395 nm excita- 
tion*. Another change, Val 163-Arg, not only enhances the magni- 
tude of the Ser 65-Thr mutant, but also increases the temperature 
tolerance for functional GFP expression". Molecular evolution tech- 
niques have been reported to improve GFP fluorescence". 
Unfortunately, a roster of substitutions associated with complete loss 
of function has not been published 

Because GFP in crystattum exhibits a fluorescence spectrum 
and life span nearly identical to that of GFP in aqueous solution" 
and fluorescence is not an Inherent property of the isolated fluo- 
rophore, the elucidation of its three-dimensional structure will 
help provide an explanation for the generation of fluorescence in 
the mature protein, as well as the mechanism of autocatalytic fluo- 
rophore formation. Furthermore, the development of fluorescent 
proteins with additional emission and excitation characteristics 
would dramatically expand their biological applications. Color 
vision is based on the fact that spectral properties of a common 
fluorophore, cu-retinal, are altered as a function of protein envi- 
ronment within red, blue, or green opsins". The GFP from the sea 
pansy, Renilla remformh, which exhibits a single major excitation 
peak at 498 nm, apparently uses an Identical core fluorophore to 
that of A. victoria GFP. These findings, taken together with the 
spectral changes exerted by substitutions in amino acids over 100 
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residues from the GPP fluorophore, suggest that a rational strategy 
to modify and expand the fluorescence behavior of GFP based on 
protein structure may be possible. We report the X-ray diffraction 
structure derived from a crystal of wild-type, recombinant A. vic- 
toria green fluorescent protein. 

Results 

The structure of GFP has been solved using seleniomethionyt-sub- 
stttuted protein and multiwavelength anomalous dispersion 
(MAD) phasing methods. The electron density maps produced by 
the MAD phasing were very clear, revealing a dimer comprised 
of two quite regular p-barrels with 1 i strands on the outside of 
cylinders (Figs, 1-3). These cylinders have a diameter of about 



Table 1. List of amino add side chains with close contacts 
(less than 5 A) to the fluorophore. 



Protein Fluorophore Distance 
residue atom residue atom (A) 


Arg96 


NHS 


Tyr 66 


O 


2.7 


Gin 34 


NE2 


Tyr66 


O 


3.0 


His 148 


ND1 


Tyr66 


OH 


3.3 


Gin 69 


CD 


Tyr66 


O 


3.4 


Glu 222 


0E2 


Tyr69 


CE2 


3.5 


Val 150 


CG2 


Tyr 68 


CE1 


3.6 


Phe165 


CE1 


Tyr66 


CD1 


3.6 


Thr 203 


CG2 


Tyr 66 


CE2 


3.6 


lie 167 


C01 


Tyr66 


OH 


3.7 


Thr 62 


CG2 


Tyr66 
Tyr68 


CQ 


3.7 


Tyr 145 


CE2 


OH 


3.7 


Ser 205 


OG 


Tyr 66 


CE2 


4.0 


Vai81 


CG1 


Tyr66 


CE2 


4.4 


Gin 163 
Val 63 


NE2 
CG2 


Tyr66 
Ser65 


GO 


4.8 
4.8 



The fluorophore (g defined as the seven atoms of the phenol of Tyr 66, the six 
atoms of the imtoazolldone, and the bridging methylene between the rings. 
The amino acid side chains listed here would be expected to have the most 
direct affects on fluorescence and perhaps fluorophore formation. The atom 
names are taken from the Brookhaven Protein Data Bank nomenclature, 
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Figure 3, A topology diagram of the folding pattern In GFP, The 
8-sheet strands are shown in light green, a-heHoee In blue, and con- 
necting loops In yellow, The positions In the sequence that begin and 
end each major secondary structure element are also given. The anti- 
parallel strands (except for the Interactions between stands 1 and 6} 
make a tightly formed barrel. 




Figure 4. Model of the fluorophore and Hs environment superimposed 
on the MAD-phased electron density map at 2.2 A resolution. The 
clear definition throughout the map allowed the chain to be traced 
and side chains to be well placed. The density for Ser 65, Tyr 66 and 
Gly 67 Is quite consistent wtth the dehydrotyroetne-lmldazolidone 
structure proposed for the ftuorophore. Many of the side chains adja- 
cent to the fluorophore are labeled. Figures 4 and 6 were produced 
wtth O (ref. 43). 



30 A and a length of about 40 A. Inspection of the density within 
the cylinders reveals modified tyrosine side chains as a part of an 
irregular a-helical segment (Fig. 4). Small sections of a helix also 
form caps on the ends of the cylinders. This motif, with a single 
a-helix inside a very uniform cylinder of JJ-sheet structure, repre- 
sents a new protein class, as it is not similar to any other known 
protein structure. 

The fluorophore is highly protected, located on the central helix 
within a couple of angstroms of the geometric center of the cylin- 
der. The pocket containing the fluorophore has a surprising num- 
ber of charged residues in the immediate environment (Fig. 5, 
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Table 1). The environment around the fluorophore includes both 
apolar and polar amino acid side chains. Phe 64 and Phe 46 are 
near the fluorophore and separate the single tryptophan, Trp 63 
from direct contact with fluorophore (closest distance of 13 A). A 
table of all atoms that come in contact with the fluorophore and 
their distances to the fluorophore is provided (Table I). 

The crystallographic contacts are all rather tenuous, consisting 
of a few amino acid side chains for each. The noncrystallographlc 
symmetry is maintained by extensive contacts and thus is likely to 
be the source of the dimerization seen in solution studies (Fig. 6). 
The dimer contacts are fairly tight and consist of a core of hydro- 
phobic side chains from Ala 206, Leu 221, and Phe 223 from each 
of the two monomers and a wealth of hydrophilic contacts (Fig. 6), 
including Tyr 39, Glu 142, Asn 144, Asn 146, Ser 147, Asn 149, 
Tyr 151, Arg 168, Asn 170, Glu 172, Tyr 200, Ser 202, Gin 204, and 
Ser 208. Contacts with other crystallographic molecules are not 
extensive, and the salt-dependence of this dimer interface and/or 
the loose contacts with neighboring molecules may explain the dif- 
ficulties with isomorphism in Initial heavy atom phasing studies. 

Mass spectrometry studies of the bacterially expressed wild- 
type and selenio-methionyl protein show masses of 26836.1 (±0.9) 
and 27069.3 (±1.4) g/mol, respectively. The masses calculated for 
the known TU#58 gene sequence, including the original inadver- 
tent Gin 80-Arg PGR error during the cloning of the gene for GFP 1 
and the cycluation and oxidation of the tyrosine are 26835.5 and 
27070.0 for the selenomethionine, respectively. The differences of 
0.6 and -0.7 g/mol are small and the results are therefore consis- 
tent with essentially complete fluorophore formation, Including 
the loss of water after cyclization. The error limits do not allow 
accurate determination of the degree of oxidation of the dehy- 
drotyrosine, however. These results indicate that the starting mate- 
rial for the crystallization was essentially fully formed GFP, and the 
lack of difference density in Fo-Fc maps in this region shows that 
the crystal contains fully cyclized GFP. 

Discussion 

The remarkable cylindrical fold of the protein seems ideally suited 
for its function. The strands of 0-sheet are tightly fitted to each 
other like staves in a barrel and form a regular pattern of hydrogen 
bonds. Together with the short a-helices and loops on the ends, 
the 'can' structure forms a single compact domain and does not 
have obvious clefts for easy access of diffusable ligands to the fluo- 
rophore. This fold, taken with the observation that the fluorophore 
is near the geometric center of the molecule, explains the observed 
protection of the fluorophore from collislonal quenching by 
oxygen (K^ < 0,004/M/sec) M and hence reduction of the quantum 
yield. Perhaps more seriously, photochemical damage by the for- 
mation of singlet oxygen through Intersystem crossing is reduced 
by the structure. The tightly constructed (3-can would appear to 
serve this role nicely, as well as provide overall stability and resis- 
tance to unfolding by heat and denaturants. 

The location of certain amino acid side chains in the vicinity of 
the fluorophore also begins to explain the fluorescence and the 
behavior of certain mutants of the protein. At least two resonant 
forms of the fluorophore can be drawn, one with a partial negative 
charge on the benzyl oxygen of Tyr 66, and one with the charge on 
the carbonyl oxygen of the tmidazolidone ring. Interestingly, basic 
residues appear to form hydrogen bonds with each of these oxygen 
atoms, His 148 with Tyr 66 and Gin 94, and Arg 96 with the imida- 
zolidone. These bases presumably act to stabilize and possibly 
further delocalize the charge on the ftuorophore. Most of the other 
polar residues in the pocket form an apparent hydrogen-bonding 
network on the side of Tyr 66 that requires abstraction of protons 
in the oxidation process. It is tempting to speculate that these 
residues help abstract the protons, As for the mutants, atoms in the 
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side chains of Thr 203, Glu 222, and lie 
167 are in van der Waals contact with Tyr 
66, so their mutation would have direct 
steric effects on the fluorophore and 
would also change its electrostatic envi- 
ronment if the charge were changed, as 
suggested*. A quantitative explanation 
will require further examination. It seems 
likely that other mutations of the residues 
near the fluorophore would also have 
effects on the absorption and/or emission 
spectra, and such experiments to change 
the electrostatic environment around the 
fluorophore are in progress. By virtue of 
their varied fluorophore environments 
and hence altered spectra, these mutants 
should lead to expanded uses of green flu- 
orescent protein as gene markers and cell 
lineage markers, and should encourage 
other uses in biotechnology. 

Mutations in regions of the sequence 
adjacent to the fluorophore, i.e. in the 
range of positions 65-67, have been sys- 
tematically explored"; some cause signifi- 
cant wavelength shifts and most cause a 
loss of fluorescence intensity. For exam- 
ple, mutation of the central iyr to Phe or 
His shifts the excitation bands, but there 
is an overall loss of intensity. Secondary 
mutations to compensate for the deleteri- 
ous Intensity effects should also now be 
possible. The Ser 65-Thr mutant is partic- 
ularly interesting because of its reported 
increase in fluorescence intensity 3 '"". The 
mechanism for increased fluorescence 
may be reduced coliisional quenching, 
because the additional methyl group may 
make for better packing in the interior of 
the protein. On the other hand, the effect 
has also been suggested to be through 
improved conversion of the tyrosine to 
dehydrotyrosine. However, the fact that 
we see significantly altered structure rela- 
tive to standard protein conformations in 
the wild-type argues against a dramatic 
increase in cyclization and/or oxidation, 
This effect is most likely produced by 
increased expression and/or folding of the 
protein. The crystal structure of the 
Ser 65-Thr mutant has also been solved", 
and it will be interesting to compare the 
two structures for clues about the fluores- 
cence and other differences. The report of 
improvements in GPP by DNA shuffling", 
comprising mutations Phe 99-Ser, Met 
153-Thr and Val 163-Ala are difficult to 
explain based on the structure. Positions 
153 and 163 are on the surface of the pro- 
tein and may exert their effects through 
improved solubility and/or reduced 
aggregation as proposed. The Phe-Ser 
mutation at first glance would appear to 
destabilize the core of the protein and 
thus we do not know how it would 
improve the system. 



Figure 6- Stereo view of the fluorophore and ft* environment Hie H 
seen on opposite ends of the fluorophore and probat' 
rophore. Charged, polar, and nonpotar aide chains &B ct 
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Tab!* 2, Anomalous diffraction differences and scattering factors for salens omethlonyl GFP at the four wavelengths used. 



30>d>3.4(A) 



&4>d>a,7(A) 



2.7>d>2.2(A) 



0,9879 0.9794 0,6792 0.9689 0.9878 0.9794 0.9792 0.9686 0.0879 0.9794 0.9792 0.9688 



0.9879 


0.026 0.042 


0.030 


0.020 


0.9794 


(0.026) 

0.050 


0.026 


0.045 




{0.029) 






0.9792 




0.067 


0.032 


0.9686 




(0.031) 


0.049 








(0,027) 



0.046 0.037 


0.032 0.064 0.071 
(0.052) 


0.068 


0.064 


-4.0 


1.1 


0.058 0.035 


0.049 0.088 


0.065 


0.077 


-10.5 


3.9 




(0.060) 










0.074 


0.040 


0.101 


0.071 


-7.9 


5.5 


(0.042) 


0,059 


(0.062) 


0.089 


-3.4 


3.9 



The mechanism of activation of the fluorophore from ordinary 
protein structure is consistent with a nonenzymatlc cydization 
mechanism like that of Asn-Gly deamidation 14 followed by oxidation 
of the tyrosine to dehydrotyrosine, as previously suggested. The role 
of molecular oxygen in this mechanism and in GFP fluorescence is 
paradoxical, however. Molecular oxygen is proposed to be needed 
for formation of the double bond between Cot and C(J on the tyro- 
sine to form an extended aromatic system, but oxygen must also be 
excluded from regular interactions with the fluorophore or else colli- 
slonal quenching of the fluorescence or damaging photochemistry 
will occur. The low bimolecutar quenching rate suggests that the 
protein's design sacrifices efficient fluorophore formation for stabil- 
ity and higher quantum yields once fully formed. 

The excited state dynamics of GFP have been studied using 
Stark, steady state, and time-resolved fluorescence spectroscopies" 
(and Youvan and Michel-Beyerle, personal communication), The 
results suggest that proton transfer is involved in interconversions 
within two grounded and two excited states. The extended set of 
polar interactions around the flourophore could easily accomodate 
proton rearrangements, with the most likely direct effects being 
between the His 148 with the hydroxyl of Tyr 66, Arg 96 interac- 
tions with the imidazolidone, or Glu 222 interactions with the 
hydroxyl of Ser 65. Since the Ser 65/Glu 222 mutants have both lost 
their native interactions together with their approximately 
400 nm absorption bands, one possibility is that the 400-nm band 
arises from the abstraction of the Ser 65 hydroxyl proton by 
Glu 222. This is speculation, but similar spectroscopic studies on 
mutants at these positions may be able to differentiate the roles of 
these sites in excited state dynamics. 

The N-termini and C-termini truncation studies" and the fluo- 
rescent fusion products*'"* are now understandable, given the struc- 
ture of the protein. Since the C-terminus loops back outside the 
cylinder and the last seven or so amino adds are disordered it should 
not be critical to have them present, and further addition would 
seem to be easily tolerated. These residues do not form a stave of the 
barrel. The role of the N-terminus is a little less clear, because the 
first strand in the barrel does not begin until amino acid 10 or 11 . 
Thus, barrel formation does not require the N-terminal region. The 
N-terminal segment is, however, an integral part of the 'cap* on one 
end of the protein and may be essential in folding events or in pro- 
tecting the fluorophore. Again, extensions at the N-terminus would 
not disrupt the motif structure of the protein. 

The chemical modification studies 3 * using sulfhydral reagents 
can be partially explained. Reaction of one of the cysteines near 
one end of the cylinder, Cys 70, would appear to disrupt the pack- 
ing of the cap on that end, and therefore allow quenching of the 
fluorophore. Significant fluorescence intensity effects by the modi- 



fication of Cys 48 on the exterior of the protein would not be 
expected, a priori. The structure determination of the dithionite- 
reduced, nonfluorescent species has not yet been studied, but 
should provide additional data on the nature of the fluorophore. 
The pH dependence of the excitation bands at 395 nm and 475 nm 
(ref. 25) is almost certainly due to His 1 48, whose N8 atom is 3.3 A 
from the Tyr 66 hydroxyl oxygen atom of the fluorophore, although 
nuclear magnetic resonance pKa measurements or mutagenesis 
studies would be needed for confirmation. 

The dimer we see as the asymmetric unit in the crystal is likely 
to be the same one formed in solution, because the ionic strength 
of the crystallization buffer is low, and we see dimcrs at low 
(<100 mM) ionic strengths in solution. Thus, it is not surprising to 
see the large number of hydrophilic dimer contacts, The smaller 
hydrophobic patch could conceivably be involved in physiological 
interactions with aequorin, as there would be a natural advantage 
to close proximity for efficient energy transfer. It is not known at 
present whether dimers form in physiological circumstances, or 
what the effect of dimerization is on energy transfer, aside from the 
circumstantial inferences on the excitation spectra previously 
reported on the native protein". The dimer contacts should now 
be able to be modified in such a way as to disrupt the formation 
of dimers without affecting stability and folding. Other nearby 
residues could also be converted to hydrophobic residues to 
enhance dimer stability if desired. Control of the dimerization will 
be important for fluorescence resonance energy transfer studies of 
protein-protein interactions using GFP, as one would not want 
to induce association and therefore resonance energy transfer 
between the differently colored GFP proteins by mechanisms other 
than that of the target protein interactions. Mutants may also be 
developed for reduction of aggregation during expression and 
hence fewer problems with inclusion bodies. 

The three-dimensional structure of GFP has provided a physico- 
chemical basis for many observed features of the protein, including 
its stability, protection of its fluorophore, behavior of mutants, and 
dimerization properties. The structure will also allow directed muta- 
tion studies to complement random and combinatorial approaches. 

Experimental protocol 

Purification. GFP was purified from & toli strain BL2l(DE3)pLys$ 
(Novagen, Madison, WI) containing plasmid TU#58, bearing the wild-type 
A. victoria green fluorescent protein*. For the selenomethionine protein, the 
plasmid was moved to E. teAi methionine auxotroph strain B834(DE3)pLysS 
(Novagen). The purification involved cell lysis, centrifugation of cell debris, 
and four column chromatography steps: DEAE anion exchange column 
(Sigma, St, Louis, MO, CL-6B) with a zero- to 1-M NaCl gradient in 10 mM 
phosphate, 2 mM EDTA, 2 mM DTT, pH 7; a hydrophobic interaction col- 
umn (Sigma, CL-4B) with a 0.1- to zero-M phosphate gradient in 2 mM 



1280 



NATURE BIOTECHNOLOGY VOLUME 14 OCTOBER 1996 



© 1 996 Nature Publishing Group http://www.nature.com/na1urefaiotechnology 

RESEARCH ARTICLE 



EDTA, 2 mM BIT, pH 7; an HPLC anion exchange column (Bio-Rad, 
Hercules, CA, Bio-Gel DEAE-SPW) with a zero- to t-M NaCl gradient in 
10 mM phosphate, 2 mM EDTA, 2 mM DTT, pH 7; and an HPLC gel fil- 
tration column (Bio-Rad, Bio-Gel SEC-125) with 0.1 M phosphate, 2 mM 
EDTA, 2 mM DTI, pH 7. Gel filtration columns run at 10 mM phosphate 
showed predominately a twofold higher molecular weight spedes. Matrix- 
assisted laser desorption Ionization mass spectrometry was performed by the 
University of Texas Health Sciences Center analytical chemistry service. 

Crystallization. The protein was crystallized in sitting drop vapor diffu- 
sion wells (Hampton Research) at room temperature using 58% 2-methyl- 
2,4-pentanediol (Atdrlch), 50 mM morpholino ethane sulfonic add, 0.1% 
sodium azide at pH 6.8, The protein concentration varied, but was typically 
20 to 30 mg/ml. Crystals grew as green fluorescent square bipyramids up to 
0.5 mm on a side. The space group was determined to be P4,2,2 or its enan- 
tiomorph, with a=b=87.15 A and c=i 19.85 A at cryogenic temperatures, and 
a=b=89.23 A and c=l 19.78 A at room temperature. The unit cell also varies 
with changes in ionic strength, and this effect thwarted solution by multiple 
isomorphous replacement. Packing density calculations suggested that there 
were probably two molecules per asymmetric unit. 

Multl wavelength anomalous dispersion. Data were taken at Brookhaven 
beam line X4A at four wavelengths. The wavelengths to be used were deter- 
mined by reference and crystal absorption scans. The data were taken at liq- 
uid nitrogen temperature using Inverse-beam geometry in wedges of 4° and 
processed using DENZO* Native and selenomethionine data sets were also 
taken in the laboratory on an R-AXIS 1IC detector with CuKct radiation. The 
native data set used in the refinement had an R«, p of 7.7% to 1.9 A resolu- 
tion (99+% complete in alf shells with fivefold average redundancy). 
Selenium atoms were located initially by standard difference Patterson maps 
between selenium-substituted and native protein using SHEI.X96 (ref. 39) 
and HEAVV (tef. 40) and confirmed by Patterson maps using the MAD data. 
MADSYS software" was used to give the anomalous diffraction differences 
shown in Table 2 and to extract Fa, Ft, and phase Information, The resulting 
MAD-phased map was solvent flattened and twofold averaged based on the 
selenium sites using CCP4 (ref. 42) and skeletonized using the program O 
(ref. 43)i it immediately revealed two 11-stranded cylindrical B-barrels. The 
polypeptide chain was traced for one of the barrels beginning from the sele- 
nomethionines and extending the structure In each direction, helped by the 
recognition of the modified tyrosine in the middle of the barrel asTyr 66, the 
nucleus of the fluorophore. The correct enantlomorph is space group P4,2,2, 
as confirmed by the handedness of the B-barrel and the ct-heliccs. 
Refinement has been started using the program X-P!or (ref. 44) using the 
native data collected at room temperature; the current R-factor at 1.9 A is 
0.21 with an R-ftee of 0.26, with good geometry (rms bond and angle devia- 
tions from ideality of 0.013 A and 1.8°, respectively) and tight restraint of 
the noncrystallographic symmetry. All measured data were included in the 
refinement. Coordinates and structure factors have been deposited at the 
Brookhaven Protein Data Bank under accession numbers 1GFL and 
R1GFISF, respectively. 
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